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The suggestion for using such a scheme was first given by Kelley 
[4J who used performance index-to-go as the index variable. He called this 
a "transversal comparison" scheme. Time-to-go has the advantage that it 
ilways decreases monoton I cally whereas this is not always true o f perf nr 
index-to-go. A monotonically changing index variable must be used if the 
transversal comparisons are to be made over the entire flight. The trans- 
versal comparison is used here in an iterative scheme to predict the time 
to-go. Kelley's suggestion is also extended to include non-stationary 
systems and in-flight changes in the terminal constraints. 
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ABSTRACT 

A modification of the perturbation feedback control scheme of Refs, [lj , 
[ 2 ] , and [3] is presented that greatly increases its capability to handle dis- 
turbances in cases where the final time is not specified. The modified control 
scheme uses a set of precalculated gains which allows in-flight estimation of 
the change in the final time due to perturbations from a nominal path. The 
time-to-go, determined from the predicted change in final time, is used to 
enter tables of precalculated feedback control gains. 

This modified guidance scheme is applied to a re-entry glider entering 
the atmosphere of the Earth at supercircular velocities. Beginning at the 
bottom of the pull-up maneuver (nominal altitude 188,000 ft.* nominal velocity 
33,000 ft. /sec. the glider is guided to a terminal altitude of 220,000 ft. 
and zero (0) flight path angle with maximum terminal velocity. For initial 
altitudes between 167,000 and 216,000 ft. the terminal error in altitude is 
less than two feet; for initial velocities between 23,000 ft. /sec. and 43,000 
ft. /sec. the terminal altitude error is less than 13 ft. In addition, the 
terminal velocity is very close to optimal for these initial conditions. 
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1. INTRODUCTION 


Among the many feedback control schemes valid for small perturbations 
about a nominal path is the neighboring optimum control scheme [l] , [2]. 

This scheme generates a path neighboring to a nominal optimal path which 
minimizes the performance index to second order while meeting the terminal 
constraints. The control perturbation from the nominal control is a linear 
combination of the state variable perturbations weighted by a precalculated 
set of gains. The gains are determined by solving what is called the 
"accessory minimum problem" in the calculus of variations. Where more than 
one terminal condition is specified, some of the gains become infinite at 
the nominal terminal time. 

The precalculated gains are conveniently stored as a function of 
time. Due to unforeseen disturbances, the system may not pass through the 
nominal initial state at the nominal initial time. The optimal path from 
this neighboring initial state may very well intersect the terminal manifold 
at a time later than the nominal terminal time; if clock time is used to 
enter the gain tables, the gains would become unbounded before reaching the 
terminal manifold. This difficulty is avoided if the gain tables are entered 
at an "index time" determined, so that the time-to-go on the neighboring 
path is the same as the time-to-go for the nominal path, i.e., 

(t f - t) = (t^ - tp = time-to-go (1) 

N 

where t f is the estimated terminal time of the neighboring path, t f is 
the terminal time of the nominal path, t is the clock time and tj is 
the index time. As the time-to-go goes to zero the gains go to infinity 



at the terminal time of the neighboring path. The time-to-go can be esti- 
mated from inflight conditions by using (1) as 



N 

The change in the final time, t^ - t^, can be calculated in terms of the 
deviation in the state variables from their nominal values using an additional 
set of precalculated gains (see e.g.. Ref. 2). This additional set of gains, 
while adding some complexity, allows the control scheme to meet the terminal 
constraints and achieve the minimum value of the performance index with much 
greater perturbations from the nominal state. 

2. GENERAL DESCRIPTION OF CONTROL SCHEME 

The system to be controlled is described by a set of first-order non- 
linear differential equations 

x * f(x,u,t) (3) 

where x is a column vector of n state variables, f is a column vector 

of n known functions, u is a control vector of m control variables, t 

is the independent variable time and (*) = d( )/dt. 

N 

Control programs, u (t), and associated state variable programs, 
x N (t), are precalculated to produce a path starting from a given initial 
point x(t Q ) that minimizes a performance index 

J - i|»[x(t f ), t f ] 


while satisfying terminal constants 


*[x(t f ), t f ] = o 


(5) 


where tj> is a column vector of q known functions (q £ n-1) , and t^ 
is the (unspecified) terminal value of the independent variable. 

For small variations in the state from the nominal state, a control 
program can be found which generates an optimum path neighboring to the 
nominal path. The required control perturbations, as shown in Refs, [l] 
and [ 2 ] , may be obtained by multiplying the state deviations by a precal- 
culated gain matrix, A(t): 

u (t) - u N (t) - -A(t)[x(t) - x N (t)] . (6) 

To meet the constraints (5), some elements of the gain matrix, A(t), may 
become infinite at the terminal time. If the neighboring path reaches the 
terminal manifold later than the nominal path, the control variation of (6) 
becomes undefined. 

To avoid this difficulty it is suggested that the gain tables be 
entered at an index time determined so that the time-to-go on the nominal 
path is the same as the time-to-go on the neighboring path as expressed in 
(1) . The inflight estimate of the change in the terminal time (2) is found 
in terms of the deviations in the state variables from the nominal values 
weighted by a set of precalculated gains, K(t),(see Section 3 or Ref. [ 2 ]), as 

t f - t^ = t - tj * K(t)[x(t) - x N (t)] . (7) 

N 

Since K(t) , as well as A (t) , are calculated along the nominal, t f - t^ 

N 

is undefined for values of t greater than t f . However, instead of (7) an 
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implicit equation for the index time is found by using the first-order approx- 
imation 


[x(t) - x N (t)] - [x(t) - x N (t I >] - x N (t x )[t - tj (8) 

in (7) to obtain 

K(t ) [x (t ) - x N (t )] 

t - t - — I .- N -S- (9) 

1 + K(t I )x W (t I ) 

If t and the current state vector, x(t), are known, the index time can 

be determined rapidly by successive substitutions into (9), using tables of 

KUjJ/ll + K(t I )x N (t I )] and x N (t I ). 

Using Eqs. (6), (8), and u N (t) - u N (tj) + u N (t (t-tj) , the control 

N N 

u(t) may be expressed in terms of u (t^.) , x (t , t^ and feedback control 
gains, A(tj) : 

o(t) - u N (t t ) - A(c t )[x(t) - x N (tj)] 

+ [t - tj (10) 

Note, x(t) is the current estimate of the state vector and t is the 
clock time. If the system being controlled is stationary, that is, if the 
equations of motion and the boundary conditions are not explicitly dependent 
on time, it will be shown that 

[u N + A(t I )x N (t I >] - 0 (11) 

Thus the control for a neighboring path for the stationary case is independent 
of the time variation t - t^: 

u(t) - u N (tj) - A (tj) [x(t) - x N (tj)] ( 12 ) 
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In the following section A(t) and K(t) are found by solving the 
accessory minimum problem in the calculus of variations. The iterative 
procedure for finding tj will be eliminated by developing a gain associated 
with (t-tj). Also the gain matrix associated with (t-tj) in (10) is 
calculated through numerical integration. The need for numerical differen- 
tiation to find i M (t) is eliminated. 

3. THE ACCESSORY MINIMUM PROBLEM 

The feedback gains used to estimate small changes in the control 
variable and final time are found by solving the accessory minimum problem 
(see e.g., Ref. [ 2 ] or [ 5 ]). About a stationary path (a path satisfying 
first-order necessary conditions) the performance index expanded to second 
order is minimized subject to linear dynamics. The state space for this 
accessory problem is composed of the deviations in the state variables 
awav from the nominal values. 

The augmented performance index for the general problem stated in 

the last section is defined as 

t f 

J ■ * + | [H(x,u,X,t) - X^x] dt (13) 

t 

o 

where 

* - $[x(t f ), tj] + v T i|>[x(t f ), t^ (14) 

H - X T f[x(t) ,u,t] (15) 

Here, X and v are vector Lagrange multipliers associated with (3), and 
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(5), respectively. The initial and final times are t Q and t^. The 
Euler-Lagrange necessary conditions are 


X - -H 


H - 0 

u 


The terminal conditions are 


X(t { ) - t x 


■ H + ■ 0 


where ( ) means 3( )/3x. 


In the accessory minimum problem, 6u(t) is to be determined which 
minimizes the second-order expansion of the augmented performance index (13) 


S R m <Sx 


d 2 j - Q n d u ( + i J *[*W| “ xu 

I m T n T a dt I *- l""* 


subject to 


6x ■ f 4x + f 6u 
x u 


6x(t Q ) specified 


dt|> ■ where dtp is specified 


where the deviation in any variable Q la defined as 


6Q - Q(t) - Q”(t) 
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The weighting matrix in (20) is 



( 25 ) 


Note that this matrix is symmetric. 

First-order necessary conditions for the accessory minimum problem 
may be developed as follows. Define the variational Hamiltonian for this 
accessory problem as 



(26) 


the necessary conditions are 


6x - UH) 6x (27) 

6X - -UH) 6x (28) 

0 - (4H) Ju <29) 

From (29) the perturbed control is given in terms of <$x and 6X as 

6u - -H _1 [h 6x + H , 6x1 (30) 

uu L ux uX J 


where H uu is assumed to be non-singular. 

In order to develop a feedback law from (30) and predict the change 
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in terminal time from present state deviations, a linear relation among 
the variables fix, dv, dt^, 5A, di|< and dfl, developed in Ref. [ 2 ], is 
presented here as 


IfiHt)] 


fS(t) R(t) m(t)l [fix ( t ) 


R 1 (t) Q(t) n(t) dv 


lm T (t) n T (t) a (t)J Idt. 


The elements of the coefficient matrix of (31) are called "sweep" variables 
and are determined by a symmetric but non-linear differential equation 
(developed in Ref . \l \ ) : 


T 

R Q 

T T 
m n 


-c V 


-A B 


I 0 
n q 


where 


f - f H _1 H 
X u uu ux 

-1 T 

f h r 

u uu u 

H - H H -1 H 
XX XU uu ux 


The boundary conditions are given by (25) 


4. PROPERTIES OF SWEEP VARIABLES 


The differential equations of (32) are integrated backwards from 
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the terminal manifold, stopping at the initial time. If there are no 
conjugate points (S i 1 and the path is normal (Q < 0 for a minimum) 
as defined in Ref. [ 5 ], then (30) can be manipulated into the form (on 
an optimal path = 0 *■♦ dft ** 0) 


<5X (t)~ 


s 



dv 

m 

ft T 

A 

Q 


fix(t) 

- dt f- 


aT 

aT 




_ra 

n _ 



(36) 


where 



(37) 

(38) 

(39) 

(40) 

(41) 


It should be noted that these new variables, ("), still satisfy (32). This 
transformation is desirable in order that the perturbed control of (30) be 
written as a feedback law operating on the deviations of the state variables. 
When integrating (32), this transformation must be taken in a time interval 
which lies some time before the terminal boundary (corresponding to Q close 
to zero) and some time after a conjugate point defined for an arc with no 
terminal constraints on the state space but with fixed terminal time (this 
corresponds to S becoming unbounded) . For the atmospheric re-entry guidance 
problem to be given in Section 7, S does go to «° although S remains 
bounded (also see Appendix C) . 
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5. FEEDBACK GAINS 


The feedback gains K(t) and a ( t) can now be identified with 
the sweep variables. From (36) the general form for predicting the change 
in the terminal time is 

dt^ ■ m T (t)6x(t) + n T di|» (42) 

where m T = K(t). This form has the additional flexibility of allowing 
small changes in the terminal constraints. 

Instead of developing the implicit equation as (9) let us suppose 
that some estimate of t-tj was made. If the system has undergone a pertur- 
bation then (2) does not hold but differs by some error, e, as 

(t-t 1 ) NEW - (t-tj) + c . (43) 

NEW 

If (43) is introduced into (42) along with (8) and (t - tj) » dt f an explicit 
equation can be obtained for calculating e: 

e - m T (tj)[x(t) - x(tj)] - KtjXt-tj) + n T (t I )di|; (44) 

where 

i(t x ) - 1 + m T (t I )f N (t I ) (45) 

Note that in (44) second-order terms are neglected. Equations (43) -(45) 
are used iteratively until c ^ 0. For the numerical problem to be discussed, 
c was very small within two iterations. 

The index time that results from this iterative procedure is used 
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to enter the gain tables from which the perturbed control variable is cal- 
culated. If the linear expression for 6X(t) in (36) is substituted into 
(30), the perturbed control is 

6u ** -A(t)6x(t) - B(t)di|> (46) 

where 

ACt) - H"i<<0[H ox (t) + H uX (t)S(t)] (47) 

6(t) • H"J(t)H uJ (c)R(t) (48) 

The control program on the neighboring path can be written as a function 
of index time with the help of (8) : 

u(t) - u N (t x ) -A(t x )[x(t) - x\ T )] + rCt^Lt-tj] - p(t I )dtp (49) 

where 

r(t x ) - + u N (t x ) (50) 

The control program also allows for variations in the terminal conditions. 

Second -order terms in (49) are neglected. 

For systems which are stationary the gains l and r are zero. In 
Appendix A the explicit dependence of these gains on time is demonstrated. 


6. ANALYTIC EXAMPLE OF CONTROL SCHEME 

The following example is chosen to illustrate some of the features 
of the previous section for non-stationary problems. The problem is to reach 
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a parabolic boundary in x,t space so that a combination of terminal time 
and integral squared velocity is minimized. This might be Interpreted as 
a one-dimensional pursuit problem in which the target has a constant accel- 
eration. 

The problem statement is: find the control variable, u, that minimizes 


J 


j f (l+u 2 )dt 
0 


(51) 


with dynamical equations 


(52) 


and the terminal boundary 

ip ■ Xj + t» - 1 ■ 0 (53) 

The augmented performance index is 

( C f 

J ■ $ + [H - Xx]dt (54) 

0 

where 

* - v(t f +|t 2 -l) (55) 

H ■ 1 + y u 2 + Xu (56) 

First-order necessary conditions are 

X * u ; X (t.) ■ v (57) 

H - 0 — ♦ u - -X (58) 

u 

H + $ t | - 0 (59) 


From (57) , (58) , and (59) the control variable as a function of terminal 
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time is 


u - - (t 2 + 2) 1/2 j (60) 

Using the dynamic equation (52) the relation between the initial state 
and terminal time is 


x 


o 


1 + t. 





(61) 


Choosing x q and solving for t^ the performance index can be evaluated 
using (51) and (60) as 


J » 2 + t 2 - t f (t 2 + 2) 1/2 


(62) 


The optimal paths for different initial conditions are plotted in 
Fig. 1. Note that the control variable along each of these paths is different. 
These paths terminate on the parabolic manifold in x,t space. Lines of 
constant time-to-go (dashed lines) and constant performance index (dashed 
dotted lines) are super-imposed on the trajectories. Suppose that the con- 
trol scheme of the last section is applied to this problem. Consider that 
the path emanating from x * .4, t = 0 is the nominal path. Now at t ** 

.32 the state x = .3 is measured. If present time is used as the index 

N 

time, then the nominal value of the state would be x = .7. As t approaches 
t = .5, the nominal path meeds the terminal manifold. From this point on, 
the control on the neighboring path is not defined since no gains are cal- 
culated beyond this point. However, if the index time is chosen so that 
the time-to-go to the terminal boundary for both the neighboring path and 
nominal path are the same, the gain tables of the nominal are entered when 
x^ ■ .4 and t = 0. As time progresses, both paths reach the terminal 
boundary together. 


- 13 - 


Note that if two paths start at the same position but at different 
times the optimal controls for each resulting optimum path are not the same. 
This difference in time between the present time and index time for non- 
stationary problems must explicitly be Included in the calculation of the 
control deviation as given by (10) . 

The control gains for calculating the perturbed control and predicting 
the error in the difference of the times-to-go are easily found from the 
scheme of the last section and are given here as 


N N 
u +t f 


N N . v . N N n 3 
_u (t f -t T ) - (u +t f r 


[x(t) - x N (t x )] + 


N . . N.2 

u tj+ (t f ) 


I N/.N v \ t N. «.Nv 2 
Lu (t f -tj) - (u +t f ) . 


(t-t x ) (63) 


u(t) - u N (tj) - 


N 


-u 


N, N N , N, Nv 2 
u (t f -tj) - (u +t f ) 


[x(t)-x N (t I )] + 


, N* 2 


u N (t«- tl )-(u N + t«) 2 


(t-tj) 


(64) 


7. NUMERICAL EXAMPLE: 

GUIDANCE SCHEME FOR A RE-ENTRY GLIDER 

The control scheme is applied here to the problem of guiding a lifting 
re-entry vehicle at supercircular velocities from the bottom of the pull-up 
maneuver (nominal altitude 188,000 ft., nominal velocity 33,000 ft. sec. \ 
nominal flight path angle -0.1°) to an altitude of 220,000 ft. at zero flight 
path angle while maximizing the terminal velocity.* This minimizes the energy 
loss at the terminal altitude. The nomenclature for this problem is given in 


'This starting point was chosen to save computer time. Usually control 
begins at entry into the atmosphere (altitude - 400,000 ft.) as in Ref. . 

Note that from the entry point the performance index, velocity, is not 
monotonically decreasing . 
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Fig. 2. The aerodynamic forces, lift and drag, are varied through the con- 
trol variable, angle-of-attack a(t). The wing loading of the vehicle, 

_2 

mg Q /S, was taken as 61.3 lb. ft. . The 1956 ARDC standard atmosphere 
model was used. The lift-drag characteristics of the vehicle are shown 
in Fig. 3. The non-linear equations of motion of a point mass about a 
spherical non-rotating Earth are given in Appendix B. The state variable 
histories for the nominal optimal path are given in Fig. 4. 

This problem is stationary and as such is Independent of the start- 
ing time. Therefore, the nominal may be thought of as shifted in time so 
that the terminal time of the nominal is the same as that of the neighbor- 
ing path. This again causes the time-to-go on both paths to be the same. 

The gains for the continuous in-flight prediction of the error in time-to- 
go is given by K^, K^, tablulated in Fig. 4 where 

e - K^(t I )[v(t)-V N (t I )] + K^(t I )[ Y (t)- Y N (t I )] + K^t^lMO-h^tj)] (65) 

The index time, t^, is found so that e ■ 0 by the procedure of Sect. 5. 

Once tj is found the perturbed value of angle-of-attack is calculated 
using the feedback gains K®, K®, tabulated in Fig. 4 as 

a(t) - a N (t x ) - K^t^CvCt) - V N (t I )] + K“( tj ) [y (t)- N (t I >] 

+ K^( tl )[h(t) - h 11 ^)] (66) 

The first-order necessary conditions and coefficient matrices for the sweep 
equations for this problem are given in Appendix B. 

The feedback gains tabulated in Fig. 4 are seen to be positive at 
the beginning of the flight and then go to negative values. This same 
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behavior occurs in guiding a vehicle to a fixed point along the minimum 
distance path on the surface of a sphere (see Appendix C) . Positive feed- 
back for the re-entry problem is required to maximize the velocity whereas 
negative feedback is required to meet the terminal constraint. Near the 
end of the flight the guidance scheme concentrates on meeting the terminal 
constraints, whereas with time-to-go large, it concentrates on maximizing 
terminal velocity. 

The modified control scheme which uses time-to-go all along the path 

is compared with the control scheme which uses present time as the Index 

time (this Is the same control scheme used In Ref. [l]). The control schemes 

are tested by introducing variations first in the altitude and then in the 

velocity. Figure 5 shows trajectories in altitude-velocity space generated 

by the modified control scheme for inltlel altitudes between 176,000 ft. 

and 216,000 ft. all with initial velocity - 33,100 ft. sec.”*’ and initial 
o 

y ■ -0.10. Plotted also on this chart are constant time-to-go curves. For 
paths resulting from positive initial altitude perturbations, the estimated 
time-to-go is shorter than the nominal time-to-go. Thus, the gains used 
were larger negative values for this index time than they would have been 
if nominal time-to-go had been used to enter the gain tables. For trajectories 
initialized by h » 216,000 ft. the path reaches the terminal point in less 
than half the nominal time. Prediction of the perturbed terminal time was 
very good. The angle-of-attack programs for two paths with large initial 
altitude variations are shown with the nominal o-program in Fig. 6. In 
the altitude-velocity space of Fig. 7, a family of trajectories generated 
by the modified control scheme are shown for initial velocities between 
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23,000 ft. /sec. and 46,000 ft. /sec. 

The errors in final altitude when the flight path angle is zero 
are shown in Fig. 8 for variations in initial altitude and in Fig. 9 for 
variations in initial velocity. For the scheme based on clock time large 
errors are incurred; for <5h(t Q ) ■ 16,000 ft., the terminal altitude error 
is 1400 ft. (Fig. 7) and for <Sv(t Q ) - 6,000 ft. /sec. the terminal altitude 
error is 2400 ft. For the range of initial altitudes and velocities given 
earlier for the modified scheme, Fig. 8 indicates the terminal altitude 
error is less than 3 ft. and Fig. 9 also indicates small altitude errors 
except for large initial variations in velocity such as 13,000 ft. /sec. 
where the error is 93 ft. 

Not only does the modified scheme meet the terminal constraints on 
y(t^) and h(t^) better but it also achieves greater terminal velocity. 

A comparison in terminal velocity between the control scheme based on clock 
time, the modified control scheme and the first-order approximation relating 
changes in the terminal velocity to changes in the initial conditions (the 
adjoint variable) is shown in Fig. 10 for variations in Initial altitude 
and in Fig. 11 for variations in initial velocity. The greatest improvement 
made by the modified control is for 6h(t ) <0 and <5v(t ) < 0. It is 
for these variations that the scheme based on clock time "runs" out of 
gains and uses the largest a available (± 60 DEG). However, the modified 
scheme does better in maximizing the velocity for all variations. Note 
that the terminal velocity of the modified scheme lies very close to the 
first-order approximation. 
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8. COMMENTS ON EXTENSIONS OF THE NUMERICAL EXAMPLE 


If the guidance scheme is initialized at entry into the atmosphere, 
the velocity first increases and then decreases. This makes it impossible 
to use velocity-to-go as an index variable as proposed in Ref. 4 . However, 
in the region where velocity is monotonically decreasing the number of 
gains can be reduced by one if velocity-to-go is used as the index variable 
instead of time. The dynamics are velocity dependent; i.e., non-stationary 
with respect to velocity. The gains f. and r of (45) and (50) are non- 
zero and can be easily calculated from the now 2-vectors, m, f and A. 

The gains r and Jt would be also non-zero if the boundary conditions 
were explicit functions of time. This would occur if the terminal positions 
were constrained and the rotation of the Earth Included. 

9. CONCLUSIONS 

The performance of the re-entry guidance scheme demonstrates that 
using time-to-go as the index variable over the entire path increases greatly 
the range of possible initial conditions while still meeting the terminal 
conditions and achieving the maximum value of performance. The appropriate 
gain to choose depends upon the estimated flight time to the terminal boundary, 
not the clock time. 

The range of initial conditions handled is so great that, in general, 
only one reference nominal would have to be stored on an on-board computer 
to guide a re-entry vehicle. Even further savings in storage may be gained 
by storing only the coefficients of a polynomial fit to the reference path 
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and gains. For gains that have singularities at the terminal point it is 
suggested the polynomial also have a singular term proportional to some 
power of (t f -t) -1 . 
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APPENDIX A 


Feedback Gains on (t-t^) 

A geometrical interpretation of the sweep variables is given through 
a Hamilton-Jacobi viewpoint as presented in Ref. [ 2 ]. An optimal return 


function, V(x,t,p), is defined as the value of J of (15) when starting 
from x at time t je t^ keeping « 0 along the path and meeting 


terminal constraints 

4 > 

with £2 ■ 0. For small variations in 

x , ip and 0 

the sweep and adjoints variables can be interpreted in terms of 
return function as 

the optimal 

x T (t) 

- 

V x(t) 

(A-l) 

S(t) 

m 

V x (t)x(t) 

(A-2) 

R(t) 

m 

%X (t) 

(A-3) 

Q(t) 

m 

v *1> 

(A-4) 

m(t) 

s 

Vfix( t) 

(A-5) 

n (t) 

m 


(A-6) 

o(t) 

m 

V £in 

(A- 7) 


The partials are Implied by the expansions of (36). 

Using the above identifications the explicit dependence of the gain 
r in (50) on time is illustrated. Also an alternative scheme for calculating 
r is given where u^ need not be calculated by numerical differentiation 
on the computer. The expression for A given by (47) in terms of the return 
function is 
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A 


H 1 [V f + f T V ] 
UU L X ux u XX J 


(A-8) 


From ll y ■ 0 the value of u along an optimal path is 


u - -H" 1 [Vf f + Vf + ■“ (V )f 1 

uu L x ux x ut dt x u J 


where 



V f 


xx 


+ V 


xt 


(A-9) 


(A-10) 


Note that the order of total and partial differentiation is important. 
Introducing (A-8) and (A-9) into the expression for (50) gives 

r - H _1 [v f + V f ] - H _1 [V f 1 

UU L X ut tx U J UU L X U J t 


(A-ll) 


If V f is not explicitly dependent upon time r ■ 0 and the neighboring 
X u 

control program is stationary. Note that u of (A-9) is easily calculated 
by using 



-V f 


x x 


(A-12) 


where V x is identified as the adjoint in (A-l) . 

The dependency of the gain Z on t is demonstrated directly once 
<5x ** dx - fdt is used in (36) . The change in the terminal time is then 
related to a change in the present time as 


dt^ « m*dx - m^fdt + d\p (A-13) 

A T 

The term m f is seen to be -9t^/8t keeping x,i|» and ft constant. Then 
Z defined by (45) is 
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8 - 


1 


(A-14 ) 



3 t 


A stationary problem depends only on the time interval and not on a parti- 
cular value. Thus if the problem is begun a time dt late, it will finish 
a time dt late. That is 3t f /3t = 1 which in turn implies that 8 = 0 
for stationary problems. To calculate 8, in terms of partials of the return 
function with respect to t, time may be regarded as a state variable and 
the index time tj be the independent variable. Since the problem is not 
explicitly dependent on the new variable t^, the indexing variable t^ 
is found by entering the gain tables so that the predicted value of the 
change in the terminal value of t^ is zero. This is equivalent to nulling e. 
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APPENDIX B 


Equations Used In the Atmospheric Re-entry Problem 

The equations of motion for a re-entry vehicle about a non-rotating 
spherical Earth are 


V - - 


v v2s 

— - 8 * lnT 


(B-l) 


C L pVS V A 
2m + R+h “ V C0SY 


V siny 


(B-2) 

(B-3) 


where V, y, h is the state vector and t ■ time is the independent variable. 
The variational Hamiltonian is 


H - -X 


c »V 2 S 


[ C L pVS + 

JL _ &\ C08Y 

an + 8 ,lnT 

T A 

Y 

2m + 

R+h Vj C08Y 


(B-4) 


+ X h [v siny] 

The Euler -Lagrange equations are 


C p vs 

- H v “ — X v " 


X - -H - g COSY x + 

y y v 


c lP s 

2m 


_V &. 

R+h V 


_i_ + JL 

R+h y 2 


COSy 


X - sinY X h (B-5) 
Y 


sinY X - V cosy X h 


X h " 


- H h" 



, y . C / S J 

Y i 

v 2g ] 

m 

" R+h 1 Y 2m 3h 

A T 
V 

(c*M ,> 2 V(R+h >J 


C T VS . 

_ L 3o 

2m 3h 


(B-6) 


C98Y 


(B-7) 
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The partials of the Hamiltonian needed to evaluate the matrices A, B, and 
C of (33), (34), and (35) are 



(B-8) 


H 

Vy 




siny + X^ cosy 


(B-9) 


C.S . 

L 3p 

2m 3h 


(R+h ) ‘ 


V (R+h) 


(B-10) 



(B-ll) 

(B-12) 

(B-13) 

(B-14) 

(B-15) 

(B-16) 

(B-17) 

(B-18) 



3C 

h -£2S 

X a 2m 3a 
Y 


H, - 0 
X h a 


H 


X V 
v 


v vs 

m 


H 


V s 

XV* m 
Y 


R+h 2) 


cosy 


> f v 


H - siny 

V 


H X Y " "8 cos Y 
v T 


H 


X Y 
Y 


JL _ S.1 

R+h Vj 


cosy 




H. - V cosy 

V 


u . _ C D V S i£.+ 2g_sinx 

Ih 2m 3h R + h 

V 


C L VS 3p 

H X h - ST 3h - 

Y 


(R+h) 


2 V (R+h) 


sfcr) C0SY f f h 


V " 0 


(B-19) 

(B-20) 

(B-21) 

(B-22) 

(B-23) 

(B-24) 

(B-25) 

(B-26) 

(B-27) 

(B-28) 

(B-29) 
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APPENDIX C 


Character of the Feedback Control Gains 

The feedback control gains for the optimal guidance scheme of a 
re-entry vehicle have positive values at the beginning of the flight and 
become negative toward the end of the flight (see Fig. 4). The same behavior 
is found for the control gains in guiding a vehicle along the shortest 
path to a fixed point on a sphere (Ref. [ 5 ]). Spherical coordinates are 
used as defined in Fig. 12. The element of distance, ds, on the surface 
of the sphere is 

ds - r[(de) 2 + cos 2 e(d<J>) 2 ] 1/2 (C-l) 


where r 


where 


radius of the sphere. 

.♦i 2 

[u + cos 


The problem is to find 

V) 1/2 d* 


J 

O 


u (4>) 


to minimize 

(C-2) 


9(0) = 0 , eot^) = 0 

The method of solution for this problem will be to solve the accessory minimum 
problem using the Riccati equation approach given in the text. Here will 
be the independent variable. 

It is straight forward to show that u(<f>) *» 0 , 0 (<p ) * 0 satisfies 
the first-order necessary conditions. Let us consider neighboring paths 
by expanding the performance index (C-2) to second order as 
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Here u and 0 are deviations away from the nominal u « 0 , $ ■ 0. The 
Hamiltonian for the accessory minimum problem is 


H ■ |[u 2 - 8 2 ] + Xu (C-A) 

and the Euler-Lagrange equations are 

- - H e • x( *i> ■ v <c - 5) 

H - 0 u * -X (C-6) 

u 

where v is a Lagrange multiplier on the terminal constraint 0(4^) ■ 0. 

The Riccati transformation of (36) is used to relate X to 0 in 


developing the feedback law. Here for f - 0 , f u ■ 1 , H Qe » -1 , 

H a - 0 , H ■» +1, the sweep variables are determined from (32) 

0u uu r 

- S 2 + 1 ; S(^) - 0 (C-7) 

|| - SR ; R(* x ) - 1 (C-8) 

|| - R 2 ; Q(4> 1 ) - 0 (C-9) 

The solutions in the interval 0 _< 4> 1 - <J> ir/2 are easily obtained: 

S *» -tan((j)^-<|») (C-10) 

R m sec(<f>^-$) (C-ll) 

Q ■ -fcan(<t>j-<|>) (C-12) 


Note that as ^ - <f> + ff/2, S and Q go to «. This value of S corresponds 
to a problem where is fixed but 8 Of^) is free. S ■ “ corresponds to 

a conjugate point for the unconstrained problem. If S is transformed fto 


(C-13) 


that it relates to the fixed end point problem, 0(4)^) - 0, 

S « S - RQ “ ctn[(^ - 4 ] 

S, R, and Q do not exist for > tt/2, but S (propagated also by 

(C-7)) exists in the interval 0 <_ - 4 <_ it where at (<^- 4 ) » 0, 

S -«*>. Note S -*• «*> as - <J> -*• it, so the latter is a conjugate point 
for the constrained problem. From the point $ ■ 0 , 0 ■ 0 to <f> «• it, 

0*0 there are an infinite number of great circles which all give the 
same value of the performance index. Using the Riccati transform and (06) 
the neighboring optimal feedback law is 

u - - §6 . (014) 

The feedback gain, -S(4>), is negative between 0 <_ - <f> £ it/ 2 and 

positive between ir/2 <. _< it. For values of > tt the nominal 

path is not a minimizing path. 

Consider a variation from the equatorial nominal path to the point 
A shown in Fig. 12. The optimum path from A to the terminal point 
(6 ■ 0 , 4 ■ it) diverges from the nominal path until <f> « tt/ 2, then 
converges to the nominal path as <f> -*■ tt. Consider an airplane flying 
eastward along the equator to a destination on the equator. If a disturbance 
moves the airplane north of the equator when there is more than 90° to go 
to its destination, it should change its flight path to slightly north of 
east (positive feedback). If there were less than 90° to go, it should 
change its flight path to slightly south of east (negative feedback) . Using 
the feedback law of (C-14) , a control is found initially which forces the 
neighboring path further from the nominal path. The slope of the neighboring 
path in the direction away from the nominal is an effort to minimize the 
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performance index. This corresponds to positive feedback in (C-14) . 

However, after the median <t> - ir/2 is reached, the slope of the neighboring 
path is directed toward the nominal in an effort to meet the terminal con- 
straint. 

Observing that ■ -1 in (C-3) , it seems possible that increasing 

2 

0 might minimize 6 J. In fact, a necessary condition for the existence 
of a conjugate point is that H ao < 0. 

The following similarities between this simple problem and the re- 
entry problem are: (a) The feedback gains tabulated in Fig. 4 become 

positive at the beginning of the flight; (b) -C - H - H H -1 H of (32) 

° xx xu uu ux 

was not semi-negative definite [5] (here the performance index is to be 
maximized) , and (c) the sweep variable S became unbounded along the optimal 
trajectory for the constrained problem. The latter means that along this 
path there is a conjugate point for the problem maximizing velocity with no 
terminal constraints. These same things also occurred for the nominal 
trajectory of Ref. [l] . 
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FIG. 1 TRAJECTORIES IN PURSUIT PROBLEM 
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FIG. 2 GEOMETRY AND NOMENCLATURE OF ATMOSPHERIC 
RE-ENTRY EXAMPLE PROBLEM. 
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FIG. 3 LIFT-DRAG POLAR FOR RE-ENTRY VEHICLE 
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FIG. 4 TABLE OF STATE VARIABLE HISTORIES FOR THE NOMINAL MAXIMUM VELOCITY PATH, GAINS 
FOR INFLIGHT PREDICTION OF THE ERROR IN TIME-TO-GO, AND FEEDBACK GAINS. 
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FIG. 5 TRAJECTORIES GENERATED BY MODIFIED NEIGHBORING 
OPTIMUM CONTROL SCHEME DUE TO INITIAL ALTITUDE 
PERTURBATIONS. 
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FIG. 6 ANGLE -OF- ATTACK PROGRAMS FOR NOMINAL PATH 

OF A RE-ENTRY VEHICLE AND TWO PERTURBED PATHS. 
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FIG 9 TERMINAL ERROR IN ALTITUDE DUE TO INITIAL VELOCITY PERTURBATIONS FOR 
CONTROL SCHEME BASED ON TIME-TO-GO AND CLOCK TIME. 





FIG. II TERMINAL VELOCITY COMPARISON OF CONTROL SCHEMES BASED ON TIME-TO-GO AND CLOCK 
TIME FOR INITIAL VELOCITY PERTURBATIONS. 
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